Overview
Brought to you by YData
Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 105542 |
| Missing cells | 416 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 20.1 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Text | 5 |
| Categorical | 10 |
article_id is highly overall correlated with product_code | High correlation |
colour_group_code is highly overall correlated with colour_group_name and 1 other fields | High correlation |
colour_group_name is highly overall correlated with colour_group_code and 5 other fields | High correlation |
department_no is highly overall correlated with index_code and 3 other fields | High correlation |
garment_group_name is highly overall correlated with garment_group_no and 1 other fields | High correlation |
garment_group_no is highly overall correlated with garment_group_name and 1 other fields | High correlation |
graphical_appearance_name is highly overall correlated with graphical_appearance_no | High correlation |
graphical_appearance_no is highly overall correlated with colour_group_name and 2 other fields | High correlation |
index_code is highly overall correlated with department_no and 4 other fields | High correlation |
index_group_name is highly overall correlated with department_no and 4 other fields | High correlation |
index_group_no is highly overall correlated with department_no and 4 other fields | High correlation |
index_name is highly overall correlated with department_no and 4 other fields | High correlation |
perceived_colour_master_id is highly overall correlated with colour_group_name and 1 other fields | High correlation |
perceived_colour_master_name is highly overall correlated with colour_group_code and 4 other fields | High correlation |
perceived_colour_value_id is highly overall correlated with colour_group_name and 2 other fields | High correlation |
perceived_colour_value_name is highly overall correlated with colour_group_name and 3 other fields | High correlation |
product_code is highly overall correlated with article_id | High correlation |
product_group_name is highly overall correlated with garment_group_name and 2 other fields | High correlation |
product_type_no is highly overall correlated with product_group_name | High correlation |
section_no is highly overall correlated with index_code and 3 other fields | High correlation |
graphical_appearance_no is highly skewed (γ1 = -45.01901161) | Skewed |
article_id has unique values | Unique |
Reproduction
| Analysis started | 2025-11-20 16:47:04.208986 |
|---|---|
| Analysis finished | 2025-11-20 16:47:31.831505 |
| Duration | 27.62 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
article_id
Real number (ℝ)
High correlation Unique
| Distinct | 105542 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.9842457 × 108 |
| Minimum | 1.0877502 × 108 |
|---|---|
| Maximum | 9.59461 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | 1.0877502 × 108 |
|---|---|
| 5-th percentile | 4.9381002 × 108 |
| Q1 | 6.169925 × 108 |
| median | 7.02213 × 108 |
| Q3 | 7.96703 × 108 |
| 95-th percentile | 8.8937901 × 108 |
| Maximum | 9.59461 × 108 |
| Range | 8.5068599 × 108 |
| Interquartile range (IQR) | 1.797105 × 108 |
Descriptive statistics
| Standard deviation | 1.2846238 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.18393165 |
| Kurtosis | 0.66097576 |
| Mean | 6.9842457 × 108 |
| Median Absolute Deviation (MAD) | 90074996 |
| Skewness | -0.57728335 |
| Sum | 7.3713126 × 1013 |
| Variance | 1.6502583 × 1016 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 959461001 | 1 | < 0.1% |
| 108775015 | 1 | < 0.1% |
| 108775044 | 1 | < 0.1% |
| 108775051 | 1 | < 0.1% |
| 110065001 | 1 | < 0.1% |
| 110065002 | 1 | < 0.1% |
| 946527001 | 1 | < 0.1% |
| 946748001 | 1 | < 0.1% |
| 946748003 | 1 | < 0.1% |
| 946748004 | 1 | < 0.1% |
| Other values (105532) | 105532 |
| Value | Count | Frequency (%) |
| 108775015 | 1 | |
| 108775044 | 1 | |
| 108775051 | 1 | |
| 110065001 | 1 | |
| 110065002 | 1 | |
| 110065011 | 1 | |
| 111565001 | 1 | |
| 111565003 | 1 | |
| 111586001 | 1 | |
| 111593001 | 1 |
| Value | Count | Frequency (%) |
| 959461001 | 1 | |
| 957375001 | 1 | |
| 956217002 | 1 | |
| 953763001 | 1 | |
| 953450001 | 1 | |
| 952938001 | 1 | |
| 952937003 | 1 | |
| 952267001 | 1 | |
| 950449002 | 1 | |
| 949594001 | 1 |
product_code
Real number (ℝ)
High correlation
| Distinct | 47224 |
|---|---|
| Distinct (%) | 44.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 698424.56 |
| Minimum | 108775 |
|---|---|
| Maximum | 959461 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | 108775 |
|---|---|
| 5-th percentile | 493810 |
| Q1 | 616992.5 |
| median | 702213 |
| Q3 | 796703 |
| 95-th percentile | 889379 |
| Maximum | 959461 |
| Range | 850686 |
| Interquartile range (IQR) | 179710.5 |
Descriptive statistics
| Standard deviation | 128462.38 |
|---|---|
| Coefficient of variation (CV) | 0.18393165 |
| Kurtosis | 0.66097587 |
| Mean | 698424.56 |
| Median Absolute Deviation (MAD) | 90075 |
| Skewness | -0.57728339 |
| Sum | 7.3713125 × 1010 |
| Variance | 1.6502584 × 1010 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 783707 | 75 | 0.1% |
| 684021 | 70 | 0.1% |
| 699923 | 52 | < 0.1% |
| 699755 | 49 | < 0.1% |
| 685604 | 46 | < 0.1% |
| 739659 | 44 | < 0.1% |
| 685816 | 41 | < 0.1% |
| 664074 | 41 | < 0.1% |
| 570002 | 41 | < 0.1% |
| 562245 | 41 | < 0.1% |
| Other values (47214) | 105042 |
| Value | Count | Frequency (%) |
| 108775 | 3 | |
| 110065 | 3 | |
| 111565 | 2 | < 0.1% |
| 111586 | 1 | < 0.1% |
| 111593 | 1 | < 0.1% |
| 111609 | 1 | < 0.1% |
| 112679 | 2 | < 0.1% |
| 114428 | 2 | < 0.1% |
| 116379 | 1 | < 0.1% |
| 118458 | 7 |
| Value | Count | Frequency (%) |
| 959461 | 1 | |
| 957375 | 1 | |
| 956217 | 1 | |
| 953763 | 1 | |
| 953450 | 1 | |
| 952938 | 1 | |
| 952937 | 1 | |
| 952267 | 1 | |
| 950449 | 1 | |
| 949594 | 1 |
prod_name
Text
| Distinct | 45875 |
|---|---|
| Distinct (%) | 43.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
Length
| Max length | 30 |
|---|---|
| Median length | 23 |
| Mean length | 15.535569 |
| Min length | 1 |
Unique
| Unique | 22920 ? |
|---|---|
| Unique (%) | 21.7% |
Sample
| 1st row | Strap top |
|---|---|
| 2nd row | Strap top |
| 3rd row | Strap top (1) |
| 4th row | OP T-shirt (Idro) |
| 5th row | OP T-shirt (Idro) |
| Value | Count | Frequency (%) |
| dress | 7825 | 2.6% |
| tee | 4553 | 1.5% |
| top | 3938 | 1.3% |
| shorts | 3555 | 1.2% |
| fancy | 2796 | 0.9% |
| ls | 2336 | 0.8% |
| hood | 2294 | 0.8% |
| sb | 2252 | 0.8% |
| set | 2133 | 0.7% |
| 1 | 2043 | 0.7% |
| Other values (13649) | 261891 |
Most occurring characters
| Value | Count | Frequency (%) |
| 190600 | 11.6% | |
| e | 116144 | 7.1% |
| a | 94570 | 5.8% |
| s | 79849 | 4.9% |
| r | 78145 | 4.8% |
| i | 76131 | 4.6% |
| o | 67798 | 4.1% |
| n | 65393 | 4.0% |
| t | 63950 | 3.9% |
| l | 58420 | 3.6% |
| Other values (81) | 748655 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1639655 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 190600 | 11.6% | |
| e | 116144 | 7.1% |
| a | 94570 | 5.8% |
| s | 79849 | 4.9% |
| r | 78145 | 4.8% |
| i | 76131 | 4.6% |
| o | 67798 | 4.1% |
| n | 65393 | 4.0% |
| t | 63950 | 3.9% |
| l | 58420 | 3.6% |
| Other values (81) | 748655 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1639655 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 190600 | 11.6% | |
| e | 116144 | 7.1% |
| a | 94570 | 5.8% |
| s | 79849 | 4.9% |
| r | 78145 | 4.8% |
| i | 76131 | 4.6% |
| o | 67798 | 4.1% |
| n | 65393 | 4.0% |
| t | 63950 | 3.9% |
| l | 58420 | 3.6% |
| Other values (81) | 748655 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1639655 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 190600 | 11.6% | |
| e | 116144 | 7.1% |
| a | 94570 | 5.8% |
| s | 79849 | 4.9% |
| r | 78145 | 4.8% |
| i | 76131 | 4.6% |
| o | 67798 | 4.1% |
| n | 65393 | 4.0% |
| t | 63950 | 3.9% |
| l | 58420 | 3.6% |
| Other values (81) | 748655 |
product_type_no
Real number (ℝ)
High correlation
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 234.86187 |
| Minimum | -1 |
|---|---|
| Maximum | 762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 121 |
| Negative (%) | 0.1% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 70 |
| Q1 | 252 |
| median | 259 |
| Q3 | 272 |
| 95-th percentile | 304 |
| Maximum | 762 |
| Range | 763 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 75.049308 |
|---|---|
| Coefficient of variation (CV) | 0.31954658 |
| Kurtosis | 1.1655822 |
| Mean | 234.86187 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -1.4230313 |
| Sum | 24787792 |
| Variance | 5632.3986 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 272 | 11169 | 10.6% |
| 265 | 10362 | 9.8% |
| 252 | 9302 | 8.8% |
| 255 | 7904 | 7.5% |
| 254 | 4155 | 3.9% |
| 258 | 3979 | 3.8% |
| 262 | 3940 | 3.7% |
| 274 | 3939 | 3.7% |
| 259 | 3405 | 3.2% |
| 253 | 2991 | 2.8% |
| Other values (122) | 44396 |
| Value | Count | Frequency (%) |
| -1 | 121 | 0.1% |
| 49 | 48 | < 0.1% |
| 57 | 662 | |
| 59 | 1307 | |
| 60 | 50 | < 0.1% |
| 66 | 1280 | |
| 67 | 458 | 0.4% |
| 68 | 180 | 0.2% |
| 69 | 573 | |
| 70 | 1159 |
| Value | Count | Frequency (%) |
| 762 | 3 | < 0.1% |
| 761 | 5 | < 0.1% |
| 532 | 3 | < 0.1% |
| 529 | 4 | < 0.1% |
| 525 | 1 | < 0.1% |
| 523 | 2 | < 0.1% |
| 521 | 7 | < 0.1% |
| 515 | 6 | < 0.1% |
| 514 | 1 | < 0.1% |
| 512 | 24 |
product_type_name
Text
| Distinct | 131 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 7.5308787 |
| Min length | 3 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Vest top |
|---|---|
| 2nd row | Vest top |
| 3rd row | Vest top |
| 4th row | Bra |
| 5th row | Bra |
| Value | Count | Frequency (%) |
| trousers | 11299 | 9.2% |
| dress | 10362 | 8.5% |
| sweater | 9302 | 7.6% |
| top | 8142 | 6.7% |
| t-shirt | 7904 | 6.5% |
| bottom | 4275 | 3.5% |
| blouse | 3979 | 3.3% |
| jacket | 3940 | 3.2% |
| shorts | 3939 | 3.2% |
| shirt | 3854 | 3.1% |
| Other values (140) | 55357 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 87702 | 11.0% |
| r | 86934 | 10.9% |
| s | 86754 | 10.9% |
| t | 65600 | 8.3% |
| o | 51144 | 6.4% |
| a | 50695 | 6.4% |
| i | 40748 | 5.1% |
| S | 29213 | 3.7% |
| T | 25917 | 3.3% |
| u | 23025 | 2.9% |
| Other values (41) | 247092 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 794824 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 87702 | 11.0% |
| r | 86934 | 10.9% |
| s | 86754 | 10.9% |
| t | 65600 | 8.3% |
| o | 51144 | 6.4% |
| a | 50695 | 6.4% |
| i | 40748 | 5.1% |
| S | 29213 | 3.7% |
| T | 25917 | 3.3% |
| u | 23025 | 2.9% |
| Other values (41) | 247092 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 794824 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 87702 | 11.0% |
| r | 86934 | 10.9% |
| s | 86754 | 10.9% |
| t | 65600 | 8.3% |
| o | 51144 | 6.4% |
| a | 50695 | 6.4% |
| i | 40748 | 5.1% |
| S | 29213 | 3.7% |
| T | 25917 | 3.3% |
| u | 23025 | 2.9% |
| Other values (41) | 247092 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 794824 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 87702 | 11.0% |
| r | 86934 | 10.9% |
| s | 86754 | 10.9% |
| t | 65600 | 8.3% |
| o | 51144 | 6.4% |
| a | 50695 | 6.4% |
| i | 40748 | 5.1% |
| S | 29213 | 3.7% |
| T | 25917 | 3.3% |
| u | 23025 | 2.9% |
| Other values (41) | 247092 |
product_group_name
Categorical
High correlation
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Garment Upper body | |
|---|---|
| Garment Lower body | |
| Garment Full body | |
| Accessories | |
| Underwear | |
| Other values (14) |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 15.44064 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Garment Upper body |
|---|---|
| 2nd row | Garment Upper body |
| 3rd row | Garment Upper body |
| 4th row | Underwear |
| 5th row | Underwear |
Common Values
| Value | Count | Frequency (%) |
| Garment Upper body | 42741 | |
| Garment Lower body | 19812 | |
| Garment Full body | 13292 | 12.6% |
| Accessories | 11158 | 10.6% |
| Underwear | 5490 | 5.2% |
| Shoes | 5283 | 5.0% |
| Swimwear | 3127 | 3.0% |
| Socks & Tights | 2442 | 2.3% |
| Nightwear | 1899 | 1.8% |
| Unknown | 121 | 0.1% |
| Other values (9) | 177 | 0.2% |
Length
| Value | Count | Frequency (%) |
| garment | 75854 | |
| body | 75845 | |
| upper | 42741 | |
| lower | 19812 | 7.6% |
| full | 13292 | 5.1% |
| accessories | 11158 | 4.3% |
| underwear | 5490 | 2.1% |
| shoes | 5283 | 2.0% |
| swimwear | 3127 | 1.2% |
| socks | 2442 | 0.9% |
| Other values (16) | 7102 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 182285 | 11.2% |
| r | 165779 | 10.2% |
| 156604 | 9.6% | |
| o | 114727 | 7.0% |
| a | 86526 | 5.3% |
| p | 85482 | 5.2% |
| n | 81847 | 5.0% |
| d | 81398 | 5.0% |
| t | 80347 | 4.9% |
| m | 79047 | 4.9% |
| Other values (25) | 515594 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1629636 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 182285 | 11.2% |
| r | 165779 | 10.2% |
| 156604 | 9.6% | |
| o | 114727 | 7.0% |
| a | 86526 | 5.3% |
| p | 85482 | 5.2% |
| n | 81847 | 5.0% |
| d | 81398 | 5.0% |
| t | 80347 | 4.9% |
| m | 79047 | 4.9% |
| Other values (25) | 515594 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1629636 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 182285 | 11.2% |
| r | 165779 | 10.2% |
| 156604 | 9.6% | |
| o | 114727 | 7.0% |
| a | 86526 | 5.3% |
| p | 85482 | 5.2% |
| n | 81847 | 5.0% |
| d | 81398 | 5.0% |
| t | 80347 | 4.9% |
| m | 79047 | 4.9% |
| Other values (25) | 515594 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1629636 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 182285 | 11.2% |
| r | 165779 | 10.2% |
| 156604 | 9.6% | |
| o | 114727 | 7.0% |
| a | 86526 | 5.3% |
| p | 85482 | 5.2% |
| n | 81847 | 5.0% |
| d | 81398 | 5.0% |
| t | 80347 | 4.9% |
| m | 79047 | 4.9% |
| Other values (25) | 515594 |
graphical_appearance_no
Real number (ℝ)
High correlation Skewed
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1009515.1 |
| Minimum | -1 |
|---|---|
| Maximum | 1010029 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 52 |
| Negative (%) | < 0.1% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1010001 |
| Q1 | 1010008 |
| median | 1010016 |
| Q3 | 1010016 |
| 95-th percentile | 1010023 |
| Maximum | 1010029 |
| Range | 1010030 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 22413.586 |
|---|---|
| Coefficient of variation (CV) | 0.022202329 |
| Kurtosis | 2024.75 |
| Mean | 1009515.1 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -45.019012 |
| Sum | 1.0654624 × 1011 |
| Variance | 5.0236883 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1010016 | 49747 | |
| 1010001 | 17165 | 16.3% |
| 1010010 | 5938 | 5.6% |
| 1010017 | 4990 | 4.7% |
| 1010023 | 4842 | 4.6% |
| 1010008 | 3215 | 3.0% |
| 1010014 | 3098 | 2.9% |
| 1010004 | 2178 | 2.1% |
| 1010005 | 1830 | 1.7% |
| 1010021 | 1513 | 1.4% |
| Other values (20) | 11026 | 10.4% |
| Value | Count | Frequency (%) |
| -1 | 52 | < 0.1% |
| 1010001 | 17165 | |
| 1010002 | 1341 | 1.3% |
| 1010003 | 15 | < 0.1% |
| 1010004 | 2178 | 2.1% |
| 1010005 | 1830 | 1.7% |
| 1010006 | 681 | 0.6% |
| 1010007 | 1165 | 1.1% |
| 1010008 | 3215 | 3.0% |
| 1010009 | 958 | 0.9% |
| Value | Count | Frequency (%) |
| 1010029 | 8 | < 0.1% |
| 1010028 | 86 | 0.1% |
| 1010027 | 66 | 0.1% |
| 1010026 | 1502 | 1.4% |
| 1010025 | 153 | 0.1% |
| 1010024 | 322 | 0.3% |
| 1010023 | 4842 | |
| 1010022 | 830 | 0.8% |
| 1010021 | 1513 | 1.4% |
| 1010020 | 376 | 0.4% |
graphical_appearance_name
Categorical
High correlation
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Solid | |
|---|---|
| All over pattern | |
| Melange | |
| Stripe | |
| Denim | 4842 |
| Other values (25) |
Length
| Max length | 19 |
|---|---|
| Median length | 5 |
| Mean length | 8.2858578 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Solid |
|---|---|
| 2nd row | Solid |
| 3rd row | Stripe |
| 4th row | Solid |
| 5th row | Solid |
Common Values
| Value | Count | Frequency (%) |
| Solid | 49747 | |
| All over pattern | 17165 | 16.3% |
| Melange | 5938 | 5.6% |
| Stripe | 4990 | 4.7% |
| Denim | 4842 | 4.6% |
| Front print | 3215 | 3.0% |
| Placement print | 3098 | 2.9% |
| Check | 2178 | 2.1% |
| Colour blocking | 1830 | 1.7% |
| Lace | 1513 | 1.4% |
| Other values (20) | 11026 | 10.4% |
Length
| Value | Count | Frequency (%) |
| solid | 49747 | |
| pattern | 17680 | 11.7% |
| all | 17165 | 11.4% |
| over | 17165 | 11.4% |
| 6313 | 4.2% | |
| melange | 5938 | 3.9% |
| stripe | 4990 | 3.3% |
| denim | 4842 | 3.2% |
| front | 3215 | 2.1% |
| placement | 3098 | 2.0% |
| Other values (25) | 21011 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 102988 | |
| o | 80380 | 9.2% |
| e | 77881 | 8.9% |
| i | 77859 | 8.9% |
| t | 67513 | 7.7% |
| r | 62943 | 7.2% |
| S | 55696 | 6.4% |
| d | 54006 | 6.2% |
| n | 48443 | 5.5% |
| 45622 | 5.2% | |
| Other values (32) | 201175 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 874506 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 102988 | |
| o | 80380 | 9.2% |
| e | 77881 | 8.9% |
| i | 77859 | 8.9% |
| t | 67513 | 7.7% |
| r | 62943 | 7.2% |
| S | 55696 | 6.4% |
| d | 54006 | 6.2% |
| n | 48443 | 5.5% |
| 45622 | 5.2% | |
| Other values (32) | 201175 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 874506 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 102988 | |
| o | 80380 | 9.2% |
| e | 77881 | 8.9% |
| i | 77859 | 8.9% |
| t | 67513 | 7.7% |
| r | 62943 | 7.2% |
| S | 55696 | 6.4% |
| d | 54006 | 6.2% |
| n | 48443 | 5.5% |
| 45622 | 5.2% | |
| Other values (32) | 201175 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 874506 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 102988 | |
| o | 80380 | 9.2% |
| e | 77881 | 8.9% |
| i | 77859 | 8.9% |
| t | 67513 | 7.7% |
| r | 62943 | 7.2% |
| S | 55696 | 6.4% |
| d | 54006 | 6.2% |
| n | 48443 | 5.5% |
| 45622 | 5.2% | |
| Other values (32) | 201175 |
colour_group_code
Real number (ℝ)
High correlation
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.233822 |
| Minimum | -1 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 28 |
| Negative (%) | < 0.1% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 9 |
| median | 14 |
| Q3 | 52 |
| 95-th percentile | 81 |
| Maximum | 93 |
| Range | 94 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 28.086154 |
|---|---|
| Coefficient of variation (CV) | 0.87132561 |
| Kurtosis | -1.0610471 |
| Mean | 32.233822 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7138227 |
| Sum | 3402022 |
| Variance | 788.83205 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 22670 | |
| 73 | 12171 | 11.5% |
| 10 | 9542 | 9.0% |
| 51 | 5811 | 5.5% |
| 7 | 4487 | 4.3% |
| 12 | 3356 | 3.2% |
| 72 | 3308 | 3.1% |
| 42 | 3056 | 2.9% |
| 71 | 3012 | 2.9% |
| 19 | 2767 | 2.6% |
| Other values (40) | 35362 |
| Value | Count | Frequency (%) |
| -1 | 28 | < 0.1% |
| 1 | 105 | 0.1% |
| 2 | 31 | < 0.1% |
| 3 | 709 | 0.7% |
| 4 | 94 | 0.1% |
| 5 | 1377 | 1.3% |
| 6 | 2105 | 2.0% |
| 7 | 4487 | 4.3% |
| 8 | 2731 | 2.6% |
| 9 | 22670 |
| Value | Count | Frequency (%) |
| 93 | 2106 | 2.0% |
| 92 | 815 | 0.8% |
| 91 | 681 | 0.6% |
| 90 | 129 | 0.1% |
| 83 | 473 | 0.4% |
| 82 | 435 | 0.4% |
| 81 | 1027 | 1.0% |
| 80 | 14 | < 0.1% |
| 73 | 12171 | |
| 72 | 3308 | 3.1% |
colour_group_name
Categorical
High correlation
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Black | |
|---|---|
| Dark Blue | |
| White | |
| Light Pink | |
| Grey | 4487 |
| Other values (45) |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 7.4805101 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Black |
|---|---|
| 2nd row | White |
| 3rd row | Off White |
| 4th row | Black |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| Black | 22670 | |
| Dark Blue | 12171 | 11.5% |
| White | 9542 | 9.0% |
| Light Pink | 5811 | 5.5% |
| Grey | 4487 | 4.3% |
| Light Beige | 3356 | 3.2% |
| Blue | 3308 | 3.1% |
| Red | 3056 | 2.9% |
| Light Blue | 3012 | 2.9% |
| Greenish Khaki | 2767 | 2.6% |
| Other values (40) | 35362 |
Length
| Value | Count | Frequency (%) |
| dark | 23498 | |
| black | 22670 | |
| light | 19334 | |
| blue | 18542 | |
| white | 12268 | |
| pink | 9442 | 6.0% |
| grey | 9323 | 5.9% |
| beige | 7378 | 4.7% |
| red | 5795 | 3.7% |
| green | 3731 | 2.4% |
| Other values (16) | 25065 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 87703 | 11.1% |
| k | 58405 | 7.4% |
| i | 58311 | 7.4% |
| l | 54192 | 6.9% |
| a | 52335 | 6.6% |
| 51504 | 6.5% | |
| B | 50155 | 6.4% |
| r | 49945 | 6.3% |
| h | 40420 | 5.1% |
| t | 33220 | 4.2% |
| Other values (28) | 253318 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 789508 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 87703 | 11.1% |
| k | 58405 | 7.4% |
| i | 58311 | 7.4% |
| l | 54192 | 6.9% |
| a | 52335 | 6.6% |
| 51504 | 6.5% | |
| B | 50155 | 6.4% |
| r | 49945 | 6.3% |
| h | 40420 | 5.1% |
| t | 33220 | 4.2% |
| Other values (28) | 253318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 789508 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 87703 | 11.1% |
| k | 58405 | 7.4% |
| i | 58311 | 7.4% |
| l | 54192 | 6.9% |
| a | 52335 | 6.6% |
| 51504 | 6.5% | |
| B | 50155 | 6.4% |
| r | 49945 | 6.3% |
| h | 40420 | 5.1% |
| t | 33220 | 4.2% |
| Other values (28) | 253318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 789508 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 87703 | 11.1% |
| k | 58405 | 7.4% |
| i | 58311 | 7.4% |
| l | 54192 | 6.9% |
| a | 52335 | 6.6% |
| 51504 | 6.5% | |
| B | 50155 | 6.4% |
| r | 49945 | 6.3% |
| h | 40420 | 5.1% |
| t | 33220 | 4.2% |
| Other values (28) | 253318 |
perceived_colour_value_id
Real number (ℝ)
High correlation
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2061833 |
| Minimum | -1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 28 |
| Negative (%) | < 0.1% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5638389 |
|---|---|
| Coefficient of variation (CV) | 0.48775718 |
| Kurtosis | -0.094881985 |
| Mean | 3.2061833 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.27399945 |
| Sum | 338387 |
| Variance | 2.4455922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 42706 | |
| 1 | 22152 | |
| 3 | 15739 | 14.9% |
| 2 | 12630 | 12.0% |
| 5 | 6471 | 6.1% |
| 7 | 5711 | 5.4% |
| 6 | 105 | 0.1% |
| -1 | 28 | < 0.1% |
| Value | Count | Frequency (%) |
| -1 | 28 | < 0.1% |
| 1 | 22152 | |
| 2 | 12630 | 12.0% |
| 3 | 15739 | 14.9% |
| 4 | 42706 | |
| 5 | 6471 | 6.1% |
| 6 | 105 | 0.1% |
| 7 | 5711 | 5.4% |
| Value | Count | Frequency (%) |
| 7 | 5711 | 5.4% |
| 6 | 105 | 0.1% |
| 5 | 6471 | 6.1% |
| 4 | 42706 | |
| 3 | 15739 | 14.9% |
| 2 | 12630 | 12.0% |
| 1 | 22152 | |
| -1 | 28 | < 0.1% |
perceived_colour_value_name
Categorical
High correlation
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Dark | |
|---|---|
| Dusty Light | |
| Light | |
| Medium Dusty | |
| Bright | |
| Other values (3) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 6.8123022 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Dark |
|---|---|
| 2nd row | Light |
| 3rd row | Dusty Light |
| 4th row | Dark |
| 5th row | Light |
Common Values
| Value | Count | Frequency (%) |
| Dark | 42706 | |
| Dusty Light | 22152 | |
| Light | 15739 | 14.9% |
| Medium Dusty | 12630 | 12.0% |
| Bright | 6471 | 6.1% |
| Medium | 5711 | 5.4% |
| Undefined | 105 | 0.1% |
| Unknown | 28 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dark | 42706 | |
| light | 37891 | |
| dusty | 34782 | |
| medium | 18341 | |
| bright | 6471 | 4.6% |
| undefined | 105 | 0.1% |
| unknown | 28 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 79144 | |
| D | 77488 | |
| i | 62808 | 8.7% |
| u | 53123 | 7.4% |
| r | 49177 | 6.8% |
| g | 44362 | 6.2% |
| h | 44362 | 6.2% |
| k | 42734 | 5.9% |
| a | 42706 | 5.9% |
| L | 37891 | 5.3% |
| Other values (13) | 185189 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 718984 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 79144 | |
| D | 77488 | |
| i | 62808 | 8.7% |
| u | 53123 | 7.4% |
| r | 49177 | 6.8% |
| g | 44362 | 6.2% |
| h | 44362 | 6.2% |
| k | 42734 | 5.9% |
| a | 42706 | 5.9% |
| L | 37891 | 5.3% |
| Other values (13) | 185189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 718984 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 79144 | |
| D | 77488 | |
| i | 62808 | 8.7% |
| u | 53123 | 7.4% |
| r | 49177 | 6.8% |
| g | 44362 | 6.2% |
| h | 44362 | 6.2% |
| k | 42734 | 5.9% |
| a | 42706 | 5.9% |
| L | 37891 | 5.3% |
| Other values (13) | 185189 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 718984 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 79144 | |
| D | 77488 | |
| i | 62808 | 8.7% |
| u | 53123 | 7.4% |
| r | 49177 | 6.8% |
| g | 44362 | 6.2% |
| h | 44362 | 6.2% |
| k | 42734 | 5.9% |
| a | 42706 | 5.9% |
| L | 37891 | 5.3% |
| Other values (13) | 185189 |
perceived_colour_master_id
Real number (ℝ)
High correlation
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8079722 |
| Minimum | -1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 685 |
| Negative (%) | 0.6% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 11 |
| 95-th percentile | 19 |
| Maximum | 20 |
| Range | 21 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.376727 |
|---|---|
| Coefficient of variation (CV) | 0.68862015 |
| Kurtosis | -0.36204043 |
| Mean | 7.8079722 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.80137952 |
| Sum | 824069 |
| Variance | 28.909193 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 22585 | |
| 2 | 18469 | |
| 9 | 12665 | |
| 4 | 9403 | |
| 12 | 8924 | 8.5% |
| 18 | 5878 | 5.6% |
| 11 | 5657 | 5.4% |
| 19 | 3526 | 3.3% |
| 20 | 3181 | 3.0% |
| 8 | 3121 | 3.0% |
| Other values (10) | 12133 |
| Value | Count | Frequency (%) |
| -1 | 685 | 0.6% |
| 1 | 1223 | 1.2% |
| 2 | 18469 | |
| 3 | 2734 | 2.6% |
| 4 | 9403 | |
| 5 | 22585 | |
| 6 | 1100 | 1.0% |
| 7 | 1829 | 1.7% |
| 8 | 3121 | 3.0% |
| 9 | 12665 |
| Value | Count | Frequency (%) |
| 20 | 3181 | 3.0% |
| 19 | 3526 | 3.3% |
| 18 | 5878 | |
| 16 | 3 | < 0.1% |
| 15 | 2180 | 2.1% |
| 14 | 105 | 0.1% |
| 13 | 2269 | 2.1% |
| 12 | 8924 | |
| 11 | 5657 | |
| 10 | 5 | < 0.1% |
perceived_colour_master_name
Categorical
High correlation
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Black | |
|---|---|
| Blue | |
| White | |
| Pink | |
| Grey | |
| Other values (15) |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 4.9246082 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Black |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | Black |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| Black | 22585 | |
| Blue | 18469 | |
| White | 12665 | |
| Pink | 9403 | |
| Grey | 8924 | 8.5% |
| Red | 5878 | 5.6% |
| Beige | 5657 | 5.4% |
| Green | 3526 | 3.3% |
| Khaki green | 3181 | 3.0% |
| Yellow | 3121 | 3.0% |
| Other values (10) | 12133 |
Length
| Value | Count | Frequency (%) |
| black | 22585 | |
| blue | 18469 | |
| white | 12665 | |
| pink | 9403 | |
| grey | 8924 | 8.1% |
| green | 6715 | 6.1% |
| red | 5878 | 5.4% |
| beige | 5657 | 5.2% |
| khaki | 3181 | 2.9% |
| yellow | 3121 | 2.8% |
| Other values (11) | 13233 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 83082 | |
| l | 52912 | 10.2% |
| B | 48983 | 9.4% |
| k | 35854 | 6.9% |
| i | 33948 | 6.5% |
| a | 31780 | 6.1% |
| c | 23685 | 4.6% |
| r | 23571 | 4.5% |
| n | 23386 | 4.5% |
| u | 23335 | 4.5% |
| Other values (23) | 139217 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 519753 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 83082 | |
| l | 52912 | 10.2% |
| B | 48983 | 9.4% |
| k | 35854 | 6.9% |
| i | 33948 | 6.5% |
| a | 31780 | 6.1% |
| c | 23685 | 4.6% |
| r | 23571 | 4.5% |
| n | 23386 | 4.5% |
| u | 23335 | 4.5% |
| Other values (23) | 139217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 519753 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 83082 | |
| l | 52912 | 10.2% |
| B | 48983 | 9.4% |
| k | 35854 | 6.9% |
| i | 33948 | 6.5% |
| a | 31780 | 6.1% |
| c | 23685 | 4.6% |
| r | 23571 | 4.5% |
| n | 23386 | 4.5% |
| u | 23335 | 4.5% |
| Other values (23) | 139217 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 519753 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 83082 | |
| l | 52912 | 10.2% |
| B | 48983 | 9.4% |
| k | 35854 | 6.9% |
| i | 33948 | 6.5% |
| a | 31780 | 6.1% |
| c | 23685 | 4.6% |
| r | 23571 | 4.5% |
| n | 23386 | 4.5% |
| u | 23335 | 4.5% |
| Other values (23) | 139217 |
department_no
Real number (ℝ)
High correlation
| Distinct | 299 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4532.7778 |
| Minimum | 1201 |
|---|---|
| Maximum | 9989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | 1201 |
|---|---|
| 5-th percentile | 1338 |
| Q1 | 1676 |
| median | 4222 |
| Q3 | 7389 |
| 95-th percentile | 8748 |
| Maximum | 9989 |
| Range | 8788 |
| Interquartile range (IQR) | 5713 |
Descriptive statistics
| Standard deviation | 2712.692 |
|---|---|
| Coefficient of variation (CV) | 0.59846128 |
| Kurtosis | -1.3964267 |
| Mean | 4532.7778 |
| Median Absolute Deviation (MAD) | 2556 |
| Skewness | 0.27135387 |
| Sum | 4.7839844 × 108 |
| Variance | 7358697.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7616 | 2032 | 1.9% |
| 1338 | 1921 | 1.8% |
| 8716 | 1874 | 1.8% |
| 4242 | 1839 | 1.7% |
| 7648 | 1488 | 1.4% |
| 1640 | 1429 | 1.4% |
| 1636 | 1402 | 1.3% |
| 1676 | 1359 | 1.3% |
| 1344 | 1354 | 1.3% |
| 1643 | 1339 | 1.3% |
| Other values (289) | 89505 |
| Value | Count | Frequency (%) |
| 1201 | 829 | |
| 1202 | 16 | < 0.1% |
| 1212 | 299 | 0.3% |
| 1222 | 238 | 0.2% |
| 1241 | 87 | 0.1% |
| 1244 | 667 | |
| 1310 | 251 | 0.2% |
| 1313 | 630 | |
| 1322 | 1206 | |
| 1334 | 864 |
| Value | Count | Frequency (%) |
| 9989 | 122 | 0.1% |
| 9986 | 513 | |
| 9985 | 579 | |
| 9984 | 236 | |
| 9020 | 33 | < 0.1% |
| 8956 | 363 | |
| 8917 | 421 | |
| 8888 | 269 | |
| 8852 | 281 | |
| 8815 | 21 | < 0.1% |
department_name
Text
| Distinct | 250 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
Length
| Max length | 40 |
|---|---|
| Median length | 26 |
| Mean length | 13.140219 |
| Min length | 2 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Jersey Basic |
|---|---|
| 2nd row | Jersey Basic |
| 3rd row | Jersey Basic |
| 4th row | Clean Lingerie |
| 5th row | Clean Lingerie |
| Value | Count | Frequency (%) |
| jersey | 24170 | 10.5% |
| girl | 16349 | 7.1% |
| kids | 14307 | 6.2% |
| fancy | 13087 | 5.7% |
| boy | 11674 | 5.1% |
| young | 10428 | 4.5% |
| baby | 7973 | 3.5% |
| knitwear | 7498 | 3.2% |
| basic | 7078 | 3.1% |
| woven | 6640 | 2.9% |
| Other values (132) | 111638 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 142984 | 10.3% |
| s | 126069 | 9.1% |
| 125300 | 9.0% | |
| r | 110268 | 8.0% |
| i | 87155 | 6.3% |
| o | 77105 | 5.6% |
| a | 65051 | 4.7% |
| y | 61342 | 4.4% |
| n | 54902 | 4.0% |
| c | 42943 | 3.1% |
| Other values (50) | 493726 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1386845 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 142984 | 10.3% |
| s | 126069 | 9.1% |
| 125300 | 9.0% | |
| r | 110268 | 8.0% |
| i | 87155 | 6.3% |
| o | 77105 | 5.6% |
| a | 65051 | 4.7% |
| y | 61342 | 4.4% |
| n | 54902 | 4.0% |
| c | 42943 | 3.1% |
| Other values (50) | 493726 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1386845 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 142984 | 10.3% |
| s | 126069 | 9.1% |
| 125300 | 9.0% | |
| r | 110268 | 8.0% |
| i | 87155 | 6.3% |
| o | 77105 | 5.6% |
| a | 65051 | 4.7% |
| y | 61342 | 4.4% |
| n | 54902 | 4.0% |
| c | 42943 | 3.1% |
| Other values (50) | 493726 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1386845 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 142984 | 10.3% |
| s | 126069 | 9.1% |
| 125300 | 9.0% | |
| r | 110268 | 8.0% |
| i | 87155 | 6.3% |
| o | 77105 | 5.6% |
| a | 65051 | 4.7% |
| y | 61342 | 4.4% |
| n | 54902 | 4.0% |
| c | 42943 | 3.1% |
| Other values (50) | 493726 |
index_code
Categorical
High correlation
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| A | |
|---|---|
| D | |
| F | |
| H | |
| I | |
| Other values (5) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | B |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| A | 26001 | |
| D | 15149 | |
| F | 12553 | |
| H | 12007 | |
| I | 9214 | 8.7% |
| G | 8875 | 8.4% |
| C | 6961 | 6.6% |
| B | 6775 | 6.4% |
| J | 4615 | 4.4% |
| S | 3392 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 26001 | |
| d | 15149 | |
| f | 12553 | |
| h | 12007 | |
| i | 9214 | 8.7% |
| g | 8875 | 8.4% |
| c | 6961 | 6.6% |
| b | 6775 | 6.4% |
| j | 4615 | 4.4% |
| s | 3392 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 26001 | |
| D | 15149 | |
| F | 12553 | |
| H | 12007 | |
| I | 9214 | 8.7% |
| G | 8875 | 8.4% |
| C | 6961 | 6.6% |
| B | 6775 | 6.4% |
| J | 4615 | 4.4% |
| S | 3392 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 105542 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 26001 | |
| D | 15149 | |
| F | 12553 | |
| H | 12007 | |
| I | 9214 | 8.7% |
| G | 8875 | 8.4% |
| C | 6961 | 6.6% |
| B | 6775 | 6.4% |
| J | 4615 | 4.4% |
| S | 3392 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 105542 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 26001 | |
| D | 15149 | |
| F | 12553 | |
| H | 12007 | |
| I | 9214 | 8.7% |
| G | 8875 | 8.4% |
| C | 6961 | 6.6% |
| B | 6775 | 6.4% |
| J | 4615 | 4.4% |
| S | 3392 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 105542 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 26001 | |
| D | 15149 | |
| F | 12553 | |
| H | 12007 | |
| I | 9214 | 8.7% |
| G | 8875 | 8.4% |
| C | 6961 | 6.6% |
| B | 6775 | 6.4% |
| J | 4615 | 4.4% |
| S | 3392 | 3.2% |
index_name
Categorical
High correlation
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Ladieswear | |
|---|---|
| Divided | |
| Menswear | |
| Children Sizes 92-140 | |
| Children Sizes 134-170 | |
| Other values (5) |
Length
| Max length | 30 |
|---|---|
| Median length | 21 |
| Mean length | 13.761725 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ladieswear |
|---|---|
| 2nd row | Ladieswear |
| 3rd row | Ladieswear |
| 4th row | Lingeries/Tights |
| 5th row | Lingeries/Tights |
Common Values
| Value | Count | Frequency (%) |
| Ladieswear | 26001 | |
| Divided | 15149 | |
| Menswear | 12553 | |
| Children Sizes 92-140 | 12007 | |
| Children Sizes 134-170 | 9214 | 8.7% |
| Baby Sizes 50-98 | 8875 | 8.4% |
| Ladies Accessories | 6961 | 6.6% |
| Lingeries/Tights | 6775 | 6.4% |
| Children Accessories, Swimwear | 4615 | 4.4% |
| Sport | 3392 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sizes | 30096 | |
| ladieswear | 26001 | |
| children | 25836 | |
| divided | 15149 | |
| menswear | 12553 | |
| 92-140 | 12007 | 6.6% |
| accessories | 11576 | 6.4% |
| 134-170 | 9214 | 5.1% |
| 50-98 | 8875 | 4.9% |
| baby | 8875 | 4.9% |
| Other values (4) | 21743 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 196467 | 13.5% |
| i | 155708 | 10.7% |
| s | 123889 | 8.5% |
| r | 90748 | 6.2% |
| d | 89096 | 6.1% |
| a | 85006 | 5.9% |
| 76383 | 5.3% | |
| w | 47784 | 3.3% |
| n | 45164 | 3.1% |
| L | 39737 | 2.7% |
| Other values (31) | 502458 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1452440 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 196467 | 13.5% |
| i | 155708 | 10.7% |
| s | 123889 | 8.5% |
| r | 90748 | 6.2% |
| d | 89096 | 6.1% |
| a | 85006 | 5.9% |
| 76383 | 5.3% | |
| w | 47784 | 3.3% |
| n | 45164 | 3.1% |
| L | 39737 | 2.7% |
| Other values (31) | 502458 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1452440 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 196467 | 13.5% |
| i | 155708 | 10.7% |
| s | 123889 | 8.5% |
| r | 90748 | 6.2% |
| d | 89096 | 6.1% |
| a | 85006 | 5.9% |
| 76383 | 5.3% | |
| w | 47784 | 3.3% |
| n | 45164 | 3.1% |
| L | 39737 | 2.7% |
| Other values (31) | 502458 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1452440 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 196467 | 13.5% |
| i | 155708 | 10.7% |
| s | 123889 | 8.5% |
| r | 90748 | 6.2% |
| d | 89096 | 6.1% |
| a | 85006 | 5.9% |
| 76383 | 5.3% | |
| w | 47784 | 3.3% |
| n | 45164 | 3.1% |
| L | 39737 | 2.7% |
| Other values (31) | 502458 |
index_group_no
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| 1 | |
|---|---|
| 4 | |
| 2 | |
| 3 | |
| 26 | 3392 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0321389 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 15149 | 14.4% |
| 3 | 12553 | 11.9% |
| 26 | 3392 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 15149 | 14.4% |
| 3 | 12553 | 11.9% |
| 26 | 3392 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 18541 | |
| 3 | 12553 | 11.5% |
| 6 | 3392 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 108934 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 18541 | |
| 3 | 12553 | 11.5% |
| 6 | 3392 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 108934 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 18541 | |
| 3 | 12553 | 11.5% |
| 6 | 3392 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 108934 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 39737 | |
| 4 | 34711 | |
| 2 | 18541 | |
| 3 | 12553 | 11.5% |
| 6 | 3392 | 3.1% |
index_group_name
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Ladieswear | |
|---|---|
| Baby/Children | |
| Divided | |
| Menswear | |
| Sport | 3392 |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.157473 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ladieswear |
|---|---|
| 2nd row | Ladieswear |
| 3rd row | Ladieswear |
| 4th row | Ladieswear |
| 5th row | Ladieswear |
Common Values
| Value | Count | Frequency (%) |
| Ladieswear | 39737 | |
| Baby/Children | 34711 | |
| Divided | 15149 | 14.4% |
| Menswear | 12553 | 11.9% |
| Sport | 3392 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ladieswear | 39737 | |
| baby/children | 34711 | |
| divided | 15149 | 14.4% |
| menswear | 12553 | 11.9% |
| sport | 3392 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 154440 | |
| a | 126738 | |
| d | 104746 | 9.8% |
| i | 104746 | 9.8% |
| r | 90393 | 8.4% |
| w | 52290 | 4.9% |
| s | 52290 | 4.9% |
| n | 47264 | 4.4% |
| L | 39737 | 3.7% |
| b | 34711 | 3.2% |
| Other values (13) | 264685 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1072040 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 154440 | |
| a | 126738 | |
| d | 104746 | 9.8% |
| i | 104746 | 9.8% |
| r | 90393 | 8.4% |
| w | 52290 | 4.9% |
| s | 52290 | 4.9% |
| n | 47264 | 4.4% |
| L | 39737 | 3.7% |
| b | 34711 | 3.2% |
| Other values (13) | 264685 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1072040 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 154440 | |
| a | 126738 | |
| d | 104746 | 9.8% |
| i | 104746 | 9.8% |
| r | 90393 | 8.4% |
| w | 52290 | 4.9% |
| s | 52290 | 4.9% |
| n | 47264 | 4.4% |
| L | 39737 | 3.7% |
| b | 34711 | 3.2% |
| Other values (13) | 264685 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1072040 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 154440 | |
| a | 126738 | |
| d | 104746 | 9.8% |
| i | 104746 | 9.8% |
| r | 90393 | 8.4% |
| w | 52290 | 4.9% |
| s | 52290 | 4.9% |
| n | 47264 | 4.4% |
| L | 39737 | 3.7% |
| b | 34711 | 3.2% |
| Other values (13) | 264685 |
section_no
Real number (ℝ)
High correlation
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.664219 |
| Minimum | 2 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 20 |
| median | 46 |
| Q3 | 61 |
| 95-th percentile | 77 |
| Maximum | 97 |
| Range | 95 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 23.260105 |
|---|---|
| Coefficient of variation (CV) | 0.54518999 |
| Kurtosis | -1.1000683 |
| Mean | 42.664219 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.084535432 |
| Sum | 4502867 |
| Variance | 541.03248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 7295 | 6.9% |
| 53 | 7124 | 6.7% |
| 44 | 4932 | 4.7% |
| 76 | 4469 | 4.2% |
| 77 | 3899 | 3.7% |
| 61 | 3598 | 3.4% |
| 79 | 3490 | 3.3% |
| 11 | 3376 | 3.2% |
| 46 | 3328 | 3.2% |
| 66 | 3270 | 3.1% |
| Other values (47) | 60761 |
| Value | Count | Frequency (%) |
| 2 | 2337 | 2.2% |
| 4 | 3 | < 0.1% |
| 5 | 1894 | 1.8% |
| 6 | 2725 | 2.6% |
| 8 | 2266 | 2.1% |
| 11 | 3376 | |
| 14 | 1270 | 1.2% |
| 15 | 7295 | |
| 16 | 1581 | 1.5% |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 97 | 559 | 0.5% |
| 82 | 682 | 0.6% |
| 80 | 35 | < 0.1% |
| 79 | 3490 | |
| 77 | 3899 | |
| 76 | 4469 | |
| 72 | 2034 | |
| 71 | 26 | < 0.1% |
| 70 | 280 | 0.3% |
| 66 | 3270 |
section_name
Text
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 16.743069 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Womens Everyday Basics |
|---|---|
| 2nd row | Womens Everyday Basics |
| 3rd row | Womens Everyday Basics |
| 4th row | Womens Lingerie |
| 5th row | Womens Lingerie |
| Value | Count | Frequency (%) |
| womens | 33662 | 12.8% |
| 17323 | 6.6% | |
| kids | 15153 | 5.8% |
| collection | 14419 | 5.5% |
| divided | 14275 | 5.4% |
| baby | 10551 | 4.0% |
| girl | 10128 | 3.9% |
| accessories | 9735 | 3.7% |
| everyday | 8876 | 3.4% |
| basics | 8828 | 3.4% |
| Other values (49) | 120028 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 182527 | 10.3% |
| 157436 | 8.9% | |
| s | 142303 | 8.1% |
| i | 130588 | 7.4% |
| o | 123340 | 7.0% |
| n | 99911 | 5.7% |
| r | 93569 | 5.3% |
| a | 92150 | 5.2% |
| l | 72523 | 4.1% |
| d | 67367 | 3.8% |
| Other values (38) | 605383 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1767097 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 182527 | 10.3% |
| 157436 | 8.9% | |
| s | 142303 | 8.1% |
| i | 130588 | 7.4% |
| o | 123340 | 7.0% |
| n | 99911 | 5.7% |
| r | 93569 | 5.3% |
| a | 92150 | 5.2% |
| l | 72523 | 4.1% |
| d | 67367 | 3.8% |
| Other values (38) | 605383 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1767097 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 182527 | 10.3% |
| 157436 | 8.9% | |
| s | 142303 | 8.1% |
| i | 130588 | 7.4% |
| o | 123340 | 7.0% |
| n | 99911 | 5.7% |
| r | 93569 | 5.3% |
| a | 92150 | 5.2% |
| l | 72523 | 4.1% |
| d | 67367 | 3.8% |
| Other values (38) | 605383 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1767097 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 182527 | 10.3% |
| 157436 | 8.9% | |
| s | 142303 | 8.1% |
| i | 130588 | 7.4% |
| o | 123340 | 7.0% |
| n | 99911 | 5.7% |
| r | 93569 | 5.3% |
| a | 92150 | 5.2% |
| l | 72523 | 4.1% |
| d | 67367 | 3.8% |
| Other values (38) | 605383 |
garment_group_no
Real number (ℝ)
High correlation
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1010.4383 |
| Minimum | 1001 |
|---|---|
| Maximum | 1025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 824.7 KiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 1002 |
| Q1 | 1005 |
| median | 1009 |
| Q3 | 1017 |
| 95-th percentile | 1020 |
| Maximum | 1025 |
| Range | 24 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.7310232 |
|---|---|
| Coefficient of variation (CV) | 0.0066614886 |
| Kurtosis | -1.287045 |
| Mean | 1010.4383 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.31875162 |
| Sum | 1.0664368 × 108 |
| Variance | 45.306673 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1005 | 21445 | |
| 1019 | 11519 | |
| 1002 | 8126 | 7.7% |
| 1003 | 7490 | 7.1% |
| 1017 | 7441 | 7.1% |
| 1009 | 6727 | 6.4% |
| 1010 | 5838 | 5.5% |
| 1020 | 5145 | 4.9% |
| 1013 | 4874 | 4.6% |
| 1007 | 4501 | 4.3% |
| Other values (11) | 22436 |
| Value | Count | Frequency (%) |
| 1001 | 3873 | 3.7% |
| 1002 | 8126 | 7.7% |
| 1003 | 7490 | 7.1% |
| 1005 | 21445 | |
| 1006 | 1965 | 1.9% |
| 1007 | 4501 | 4.3% |
| 1008 | 908 | 0.9% |
| 1009 | 6727 | 6.4% |
| 1010 | 5838 | 5.5% |
| 1011 | 2116 | 2.0% |
| Value | Count | Frequency (%) |
| 1025 | 1559 | 1.5% |
| 1023 | 1061 | 1.0% |
| 1021 | 2272 | 2.2% |
| 1020 | 5145 | |
| 1019 | 11519 | |
| 1018 | 2787 | 2.6% |
| 1017 | 7441 | |
| 1016 | 3100 | 2.9% |
| 1014 | 1541 | 1.5% |
| 1013 | 4874 |
garment_group_name
Categorical
High correlation
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 824.7 KiB |
| Jersey Fancy | |
|---|---|
| Accessories | |
| Jersey Basic | |
| Knitwear | |
| Under-, Nightwear | |
| Other values (16) |
Length
| Max length | 29 |
|---|---|
| Median length | 17 |
| Mean length | 10.951811 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jersey Basic |
|---|---|
| 2nd row | Jersey Basic |
| 3rd row | Jersey Basic |
| 4th row | Under-, Nightwear |
| 5th row | Under-, Nightwear |
Common Values
| Value | Count | Frequency (%) |
| Jersey Fancy | 21445 | |
| Accessories | 11519 | |
| Jersey Basic | 8126 | 7.7% |
| Knitwear | 7490 | 7.1% |
| Under-, Nightwear | 7441 | 7.1% |
| Trousers | 6727 | 6.4% |
| Blouses | 5838 | 5.5% |
| Shoes | 5145 | 4.9% |
| Dresses Ladies | 4874 | 4.6% |
| Outdoor | 4501 | 4.3% |
| Other values (11) | 22436 |
Length
| Value | Count | Frequency (%) |
| jersey | 29571 | |
| fancy | 21445 | |
| accessories | 11519 | 7.1% |
| trousers | 9827 | 6.1% |
| basic | 8126 | 5.0% |
| knitwear | 7490 | 4.6% |
| under | 7441 | 4.6% |
| nightwear | 7441 | 4.6% |
| blouses | 5838 | 3.6% |
| shoes | 5145 | 3.2% |
| Other values (20) | 47761 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 160751 | |
| s | 150245 | |
| r | 108764 | 9.4% |
| i | 59052 | 5.1% |
| a | 57461 | 5.0% |
| n | 57297 | 5.0% |
| 56062 | 4.9% | |
| c | 55942 | 4.8% |
| y | 54946 | 4.8% |
| o | 51000 | 4.4% |
| Other values (30) | 344356 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1155876 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 160751 | |
| s | 150245 | |
| r | 108764 | 9.4% |
| i | 59052 | 5.1% |
| a | 57461 | 5.0% |
| n | 57297 | 5.0% |
| 56062 | 4.9% | |
| c | 55942 | 4.8% |
| y | 54946 | 4.8% |
| o | 51000 | 4.4% |
| Other values (30) | 344356 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1155876 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 160751 | |
| s | 150245 | |
| r | 108764 | 9.4% |
| i | 59052 | 5.1% |
| a | 57461 | 5.0% |
| n | 57297 | 5.0% |
| 56062 | 4.9% | |
| c | 55942 | 4.8% |
| y | 54946 | 4.8% |
| o | 51000 | 4.4% |
| Other values (30) | 344356 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1155876 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 160751 | |
| s | 150245 | |
| r | 108764 | 9.4% |
| i | 59052 | 5.1% |
| a | 57461 | 5.0% |
| n | 57297 | 5.0% |
| 56062 | 4.9% | |
| c | 55942 | 4.8% |
| y | 54946 | 4.8% |
| o | 51000 | 4.4% |
| Other values (30) | 344356 |
detail_desc
Text
| Distinct | 43404 |
|---|---|
| Distinct (%) | 41.3% |
| Missing | 416 |
| Missing (%) | 0.4% |
| Memory size | 824.7 KiB |
Length
| Max length | 764 |
|---|---|
| Median length | 468 |
| Mean length | 142.1619 |
| Min length | 11 |
Unique
| Unique | 21430 ? |
|---|---|
| Unique (%) | 20.4% |
Sample
| 1st row | Jersey top with narrow shoulder straps. |
|---|---|
| 2nd row | Jersey top with narrow shoulder straps. |
| 3rd row | Jersey top with narrow shoulder straps. |
| 4th row | Microfibre T-shirt bra with underwired, moulded, lightly padded cups that shape the bust and provide good support. Narrow adjustable shoulder straps and a narrow hook-and-eye fastening at the back. Without visible seams for greater comfort. |
| 5th row | Microfibre T-shirt bra with underwired, moulded, lightly padded cups that shape the bust and provide good support. Narrow adjustable shoulder straps and a narrow hook-and-eye fastening at the back. Without visible seams for greater comfort. |
| Value | Count | Frequency (%) |
| and | 160065 | 6.4% |
| a | 151693 | 6.0% |
| with | 150703 | 6.0% |
| the | 135045 | 5.4% |
| in | 105374 | 4.2% |
| at | 80688 | 3.2% |
| back | 36807 | 1.5% |
| front | 36244 | 1.4% |
| soft | 35579 | 1.4% |
| waist | 34284 | 1.4% |
| Other values (5000) | 1586260 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2407661 | ||
| e | 1318549 | 8.8% |
| t | 1241876 | 8.3% |
| a | 1029247 | 6.9% |
| n | 910904 | 6.1% |
| i | 876105 | 5.9% |
| s | 828095 | 5.5% |
| o | 718446 | 4.8% |
| r | 618196 | 4.1% |
| d | 602822 | 4.0% |
| Other values (88) | 4393011 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14944912 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2407661 | ||
| e | 1318549 | 8.8% |
| t | 1241876 | 8.3% |
| a | 1029247 | 6.9% |
| n | 910904 | 6.1% |
| i | 876105 | 5.9% |
| s | 828095 | 5.5% |
| o | 718446 | 4.8% |
| r | 618196 | 4.1% |
| d | 602822 | 4.0% |
| Other values (88) | 4393011 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14944912 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2407661 | ||
| e | 1318549 | 8.8% |
| t | 1241876 | 8.3% |
| a | 1029247 | 6.9% |
| n | 910904 | 6.1% |
| i | 876105 | 5.9% |
| s | 828095 | 5.5% |
| o | 718446 | 4.8% |
| r | 618196 | 4.1% |
| d | 602822 | 4.0% |
| Other values (88) | 4393011 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14944912 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2407661 | ||
| e | 1318549 | 8.8% |
| t | 1241876 | 8.3% |
| a | 1029247 | 6.9% |
| n | 910904 | 6.1% |
| i | 876105 | 5.9% |
| s | 828095 | 5.5% |
| o | 718446 | 4.8% |
| r | 618196 | 4.1% |
| d | 602822 | 4.0% |
| Other values (88) | 4393011 |
Interactions
Correlations
| article_id | colour_group_code | colour_group_name | department_no | garment_group_name | garment_group_no | graphical_appearance_name | graphical_appearance_no | index_code | index_group_name | index_group_no | index_name | perceived_colour_master_id | perceived_colour_master_name | perceived_colour_value_id | perceived_colour_value_name | product_code | product_group_name | product_type_no | section_no | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| article_id | 1.000 | -0.042 | 0.072 | -0.073 | 0.091 | 0.010 | 0.063 | -0.000 | 0.070 | 0.083 | 0.083 | 0.070 | 0.029 | 0.063 | -0.054 | 0.032 | 1.000 | 0.076 | -0.040 | -0.042 |
| colour_group_code | -0.042 | 1.000 | 1.000 | 0.080 | 0.135 | -0.017 | 0.165 | 0.050 | 0.120 | 0.143 | 0.143 | 0.120 | -0.339 | 0.930 | 0.009 | 0.262 | -0.042 | 0.097 | 0.077 | -0.004 |
| colour_group_name | 0.072 | 1.000 | 1.000 | 0.176 | 0.157 | 0.160 | 0.194 | 0.734 | 0.216 | 0.212 | 0.212 | 0.216 | 0.844 | 0.845 | 0.814 | 0.814 | 0.072 | 0.128 | 0.141 | 0.172 |
| department_no | -0.073 | 0.080 | 0.176 | 1.000 | 0.438 | -0.054 | 0.161 | -0.097 | 0.660 | 0.650 | 0.650 | 0.660 | -0.041 | 0.156 | 0.007 | 0.119 | -0.073 | 0.333 | -0.011 | 0.314 |
| garment_group_name | 0.091 | 0.135 | 0.157 | 0.438 | 1.000 | 1.000 | 0.244 | 0.052 | 0.458 | 0.335 | 0.335 | 0.458 | 0.164 | 0.132 | 0.139 | 0.139 | 0.091 | 0.540 | 0.439 | 0.401 |
| garment_group_no | 0.010 | -0.017 | 0.160 | -0.054 | 1.000 | 1.000 | 0.255 | 0.058 | 0.356 | 0.214 | 0.214 | 0.356 | -0.024 | 0.138 | 0.027 | 0.089 | 0.010 | 0.540 | -0.053 | 0.182 |
| graphical_appearance_name | 0.063 | 0.165 | 0.194 | 0.161 | 0.244 | 0.255 | 1.000 | 1.000 | 0.189 | 0.213 | 0.213 | 0.189 | 0.181 | 0.144 | 0.305 | 0.305 | 0.063 | 0.173 | 0.115 | 0.189 |
| graphical_appearance_no | -0.000 | 0.050 | 0.734 | -0.097 | 0.052 | 0.058 | 1.000 | 1.000 | 0.018 | 0.014 | 0.014 | 0.018 | -0.096 | 0.147 | 0.019 | 0.734 | -0.000 | 0.000 | 0.013 | -0.026 |
| index_code | 0.070 | 0.120 | 0.216 | 0.660 | 0.458 | 0.356 | 0.189 | 0.018 | 1.000 | 1.000 | 1.000 | 1.000 | 0.155 | 0.187 | 0.118 | 0.118 | 0.070 | 0.376 | 0.311 | 0.642 |
| index_group_name | 0.083 | 0.143 | 0.212 | 0.650 | 0.335 | 0.214 | 0.213 | 0.014 | 1.000 | 1.000 | 1.000 | 1.000 | 0.134 | 0.182 | 0.102 | 0.102 | 0.083 | 0.159 | 0.077 | 0.764 |
| index_group_no | 0.083 | 0.143 | 0.212 | 0.650 | 0.335 | 0.214 | 0.213 | 0.014 | 1.000 | 1.000 | 1.000 | 1.000 | 0.134 | 0.182 | 0.102 | 0.102 | 0.083 | 0.159 | 0.077 | 0.764 |
| index_name | 0.070 | 0.120 | 0.216 | 0.660 | 0.458 | 0.356 | 0.189 | 0.018 | 1.000 | 1.000 | 1.000 | 1.000 | 0.155 | 0.187 | 0.118 | 0.118 | 0.070 | 0.376 | 0.311 | 0.642 |
| perceived_colour_master_id | 0.029 | -0.339 | 0.844 | -0.041 | 0.164 | -0.024 | 0.181 | -0.096 | 0.155 | 0.134 | 0.134 | 0.155 | 1.000 | 1.000 | -0.038 | 0.355 | 0.029 | 0.139 | -0.087 | 0.000 |
| perceived_colour_master_name | 0.063 | 0.930 | 0.845 | 0.156 | 0.132 | 0.138 | 0.144 | 0.147 | 0.187 | 0.182 | 0.182 | 0.187 | 1.000 | 1.000 | 0.594 | 0.594 | 0.063 | 0.113 | 0.131 | 0.147 |
| perceived_colour_value_id | -0.054 | 0.009 | 0.814 | 0.007 | 0.139 | 0.027 | 0.305 | 0.019 | 0.118 | 0.102 | 0.102 | 0.118 | -0.038 | 0.594 | 1.000 | 1.000 | -0.054 | 0.102 | -0.029 | -0.005 |
| perceived_colour_value_name | 0.032 | 0.262 | 0.814 | 0.119 | 0.139 | 0.089 | 0.305 | 0.734 | 0.118 | 0.102 | 0.102 | 0.118 | 0.355 | 0.594 | 1.000 | 1.000 | 0.032 | 0.102 | 0.060 | 0.086 |
| product_code | 1.000 | -0.042 | 0.072 | -0.073 | 0.091 | 0.010 | 0.063 | -0.000 | 0.070 | 0.083 | 0.083 | 0.070 | 0.029 | 0.063 | -0.054 | 0.032 | 1.000 | 0.076 | -0.040 | -0.042 |
| product_group_name | 0.076 | 0.097 | 0.128 | 0.333 | 0.540 | 0.540 | 0.173 | 0.000 | 0.376 | 0.159 | 0.159 | 0.376 | 0.139 | 0.113 | 0.102 | 0.102 | 0.076 | 1.000 | 0.711 | 0.271 |
| product_type_no | -0.040 | 0.077 | 0.141 | -0.011 | 0.439 | -0.053 | 0.115 | 0.013 | 0.311 | 0.077 | 0.077 | 0.311 | -0.087 | 0.131 | -0.029 | 0.060 | -0.040 | 0.711 | 1.000 | 0.027 |
| section_no | -0.042 | -0.004 | 0.172 | 0.314 | 0.401 | 0.182 | 0.189 | -0.026 | 0.642 | 0.764 | 0.764 | 0.642 | 0.000 | 0.147 | -0.005 | 0.086 | -0.042 | 0.271 | 0.027 | 1.000 |
Missing values
Sample
| article_id | product_code | prod_name | product_type_no | product_type_name | product_group_name | graphical_appearance_no | graphical_appearance_name | colour_group_code | colour_group_name | perceived_colour_value_id | perceived_colour_value_name | perceived_colour_master_id | perceived_colour_master_name | department_no | department_name | index_code | index_name | index_group_no | index_group_name | section_no | section_name | garment_group_no | garment_group_name | detail_desc | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 108775015 | 108775 | Strap top | 253 | Vest top | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Jersey top with narrow shoulder straps. |
| 1 | 108775044 | 108775 | Strap top | 253 | Vest top | Garment Upper body | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Jersey top with narrow shoulder straps. |
| 2 | 108775051 | 108775 | Strap top (1) | 253 | Vest top | Garment Upper body | 1010017 | Stripe | 11 | Off White | 1 | Dusty Light | 9 | White | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Jersey top with narrow shoulder straps. |
| 3 | 110065001 | 110065 | OP T-shirt (Idro) | 306 | Bra | Underwear | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1339 | Clean Lingerie | B | Lingeries/Tights | 1 | Ladieswear | 61 | Womens Lingerie | 1017 | Under-, Nightwear | Microfibre T-shirt bra with underwired, moulded, lightly padded cups that shape the bust and provide good support. Narrow adjustable shoulder straps and a narrow hook-and-eye fastening at the back. Without visible seams for greater comfort. |
| 4 | 110065002 | 110065 | OP T-shirt (Idro) | 306 | Bra | Underwear | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1339 | Clean Lingerie | B | Lingeries/Tights | 1 | Ladieswear | 61 | Womens Lingerie | 1017 | Under-, Nightwear | Microfibre T-shirt bra with underwired, moulded, lightly padded cups that shape the bust and provide good support. Narrow adjustable shoulder straps and a narrow hook-and-eye fastening at the back. Without visible seams for greater comfort. |
| 5 | 110065011 | 110065 | OP T-shirt (Idro) | 306 | Bra | Underwear | 1010016 | Solid | 12 | Light Beige | 1 | Dusty Light | 11 | Beige | 1339 | Clean Lingerie | B | Lingeries/Tights | 1 | Ladieswear | 61 | Womens Lingerie | 1017 | Under-, Nightwear | Microfibre T-shirt bra with underwired, moulded, lightly padded cups that shape the bust and provide good support. Narrow adjustable shoulder straps and a narrow hook-and-eye fastening at the back. Without visible seams for greater comfort. |
| 6 | 111565001 | 111565 | 20 den 1p Stockings | 304 | Underwear Tights | Socks & Tights | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 3608 | Tights basic | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1021 | Socks and Tights | Semi shiny nylon stockings with a wide, reinforced trim at the top. Use with a suspender belt. 20 denier. |
| 7 | 111565003 | 111565 | 20 den 1p Stockings | 302 | Socks | Socks & Tights | 1010016 | Solid | 13 | Beige | 2 | Medium Dusty | 11 | Beige | 3608 | Tights basic | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1021 | Socks and Tights | Semi shiny nylon stockings with a wide, reinforced trim at the top. Use with a suspender belt. 20 denier. |
| 8 | 111586001 | 111586 | Shape Up 30 den 1p Tights | 273 | Leggings/Tights | Garment Lower body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 3608 | Tights basic | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1021 | Socks and Tights | Tights with built-in support to lift the bottom. Black in 30 denier and light amber in 15 denier. |
| 9 | 111593001 | 111593 | Support 40 den 1p Tights | 304 | Underwear Tights | Socks & Tights | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 3608 | Tights basic | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1021 | Socks and Tights | Semi shiny tights that shape the tummy, thighs and calves while also encouraging blood circulation in the legs. Elasticated waist. |
| article_id | product_code | prod_name | product_type_no | product_type_name | product_group_name | graphical_appearance_no | graphical_appearance_name | colour_group_code | colour_group_name | perceived_colour_value_id | perceived_colour_value_name | perceived_colour_master_id | perceived_colour_master_name | department_no | department_name | index_code | index_name | index_group_no | index_group_name | section_no | section_name | garment_group_no | garment_group_name | detail_desc | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105532 | 949594001 | 949594 | LOGG Elvis jogger. | 272 | Trousers | Garment Lower body | 1010016 | Solid | 8 | Dark Grey | 4 | Dark | -1 | Unknown | 1919 | Jersey | A | Ladieswear | 1 | Ladieswear | 2 | H&M+ | 1005 | Jersey Fancy | Joggers in soft sweatshirt fabric with an elasticated, drawstring waist, diagonal side pockets and slim legs with ribbed hems. |
| 105533 | 950449002 | 950449 | Compact brush Fancy | 78 | Other accessories | Accessories | 1010016 | Solid | 50 | Other Pink | 5 | Bright | 4 | Pink | 4313 | Girls Small Acc/Bags | J | Children Accessories, Swimwear | 4 | Baby/Children | 43 | Kids Accessories, Swimwear & D | 1019 | Accessories | Small, folding hair brush with a rhinestone-decorated lid that has a mirror inside. Diameter 6.5 cm. |
| 105534 | 952267001 | 952267 | Heavy plain overknee tights 1p | 304 | Underwear Tights | Socks & Tights | 1010013 | Other pattern | 9 | Black | 4 | Dark | 5 | Black | 3608 | Tights basic | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1021 | Socks and Tights | Fine-knit tights with an elasticated waist that are thinner at the top and more opaque at the bottom giving them the appearance of over-the-knee socks. |
| 105535 | 952937003 | 952937 | Jets dress | 265 | Dress | Garment Full body | 1010001 | All over pattern | 13 | Beige | 2 | Medium Dusty | 1 | Mole | 1641 | Jersey | A | Ladieswear | 1 | Ladieswear | 18 | Womens Trend | 1005 | Jersey Fancy | Fitted, calf-length dress in viscose jersey with a stand-up collar and concealed zip at the back. Double layer at the top with wrapover, draped sections, close-fitting, extra-long sleeves and an asymmetric skirt with a high slit in one side. Lined. |
| 105536 | 952938001 | 952938 | Elton top | 254 | Top | Garment Upper body | 1010001 | All over pattern | 13 | Beige | 2 | Medium Dusty | 1 | Mole | 1641 | Jersey | A | Ladieswear | 1 | Ladieswear | 18 | Womens Trend | 1005 | Jersey Fancy | Fitted top in jersey with a round neckline and extra-long sleeves. Additional draped layer at the front. |
| 105537 | 953450001 | 953450 | 5pk regular Placement1 | 302 | Socks | Socks & Tights | 1010014 | Placement print | 9 | Black | 4 | Dark | 5 | Black | 7188 | Socks Bin | F | Menswear | 3 | Menswear | 26 | Men Underwear | 1021 | Socks and Tights | Socks in a fine-knit cotton blend with a small motif at the top and elasticated tops. |
| 105538 | 953763001 | 953763 | SPORT Malaga tank | 253 | Vest top | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1919 | Jersey | A | Ladieswear | 1 | Ladieswear | 2 | H&M+ | 1005 | Jersey Fancy | Loose-fitting sports vest top in ribbed fast-drying functional fabric made from recycled polyester with a racer back and rounded hem. |
| 105539 | 956217002 | 956217 | Cartwheel dress | 265 | Dress | Garment Full body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1641 | Jersey | A | Ladieswear | 1 | Ladieswear | 18 | Womens Trend | 1005 | Jersey Fancy | Short, A-line dress in jersey with a round neckline and V-shaped opening at the front with narrow ties. Long, voluminous raglan sleeves and wide cuffs with covered buttons. |
| 105540 | 957375001 | 957375 | CLAIRE HAIR CLAW | 72 | Hair clip | Accessories | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 3946 | Small Accessories | D | Divided | 2 | Divided | 52 | Divided Accessories | 1019 | Accessories | Large plastic hair claw. |
| 105541 | 959461001 | 959461 | Lounge dress | 265 | Dress | Garment Full body | 1010016 | Solid | 11 | Off White | 1 | Dusty Light | 9 | White | 1641 | Jersey | A | Ladieswear | 1 | Ladieswear | 18 | Womens Trend | 1005 | Jersey Fancy | Calf-length dress in ribbed jersey made from a cotton blend. Low-cut V-neck at the back, dropped shoulders and long, wide sleeves that taper to the cuffs. Unlined. |